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(54) Polypeptides and nucleic acids encoding the same for the diagnosis and prevention of 
tuberculosis disease. 

(57) The invention relates to an isolated nucleic acid 
of M. tuberculosis containing in its nucleotide sequence 
at least two overlapping ORFs which encode the follow- 
ing two antigenic proteins, recognized by a pool of sera 
from tuberculous patients: 

(i) B2 protein represented by SEQ ID NO 2, or a pro- 
tein homologous to B2 containing in its amino add 
sequence at least one conservative amino acid 
substitution (according to Table 1) with respect to 
the sequence represented by SEQ ID NO 2, and 

(ii) B3 protein represented by SEQ ID NO 3, or a 
protein homologous to B3 containing in its amino 
acid sequence at least one conservative amino acid 
substitution (according to Table 1) with respect to 
the sequence represented by SEQ ID NO 3, 

or a fragment of said nucleic acid, said fragment 
containing at least 15, preferably at least 30 and 
more preferably at least 45 contiguous nucleotides 
of the sequence of said nucleic acid, and encoding 
at least one epitope of the B2 and/or B3 protein, or 
its homologues. 
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,™l C !r ent i !I Venti0n relateS t0 an isolated nucleic molecule of M. tuberculosis, encoding several anti- 

£ m2c2a^^?MKf 68 enC ^ by nuc,eic methods for producing the same, and their use 
rinT™! "J k ^ for i h ! d,agnos,s of **>erculosis ^ease in mammals. The invention also relates to a vac- 

^SC"** disease in mammate ' comprisins as an active princi ^ sa « s «~ 



»o BACKGROUND OF THE INVENTION 



population^ Its h.gh morbidity (8 rn.ll.on new patients a year) and mortality (3 million deaths a year) rates witness #» 
virulence of the etiological agents (M. tuberculosis and M. bovis) * ' 

[ ^,nJ^ Ui T ba0illi 3re dfeSeminated by POSt-P"mary tuberculosis patients with open lesions (Daniel 1994- 
^~ k ^i 5) - ™ e gnostic procedures are unsatisfactory. CurturingTthe bacSTSng 'fima 

sample taken from the patent requires very long incubation times, while only mycobacteria with intact enveZ SreWen 

S£!SZ a^Deoanne" 6 ^ T** ^ ™ "<* ^ * ^SSS!SS p£ 

S^iSSrJ^ ° eb !T!' 198?) - H0wever ' commercial| y available serological methods lack speaes-specificity 
being unable to d.stinguish tuberculous bacilli from the other pathogenic and saprophytic m«*bJ^£EFZ^' 

m^i°T 09,65 6X181 am ° ng thS corres P° nd ^ ^eins of different mycobac^alspSies^ 

^L„ U „T° USPr f! 6 K nS a . MJubenub ^ ^ have been identified as immunologically significant for diao- 

rp l ^T Sn,Z6d by K Pdt,ente Sera and/or ^'"S a ce,,ular »e ^sponse. Most ol them prS- 
tTfh- ™ 2 f??"- 9 baCtena and thuS direCtly interacti "9 the "est immune system. Referenced be nSe 
to the review article of Young et al (1992) for the mycobacterial proteins described before 1992 Sole ^iSore TecSSv 

16 (Verbon et al .1992; Fnscia et al.. 1995). 18 (Manca et al. 1997). 19 (Hams et al. 1994 PresfidgTeTal 1^95)25 
ST?"-' ^^(O^etal.. 1994). 28 (Berthet etal.. iaes^rtAisse^aaSS^li^i? 

Deen cnaractenzed, and some of them have been used on an experimental scale for immunoassays The heat-shock 
ri th h^ 71 kDa °'. the t " berCUl0US badlli »— bS I**- (Young et a.. 

SKI ?i ?££E2tX ^ 0mb ' nam Pr0teinS biochemical| y characterized, those of 6 kDa (ESAT-6) 

and ^UHew^so^ fRi iS m 2 ^ 61 ^ ^ 19 (Pr6Stidge 61 * • 1995 >' 22 a "d 25 H* (MPB83 

ana 70) (Hewinson et al., 1996; Matsumoto et al., 1995), 28 kDa (erp) (Berthet etal 1995- Bioi et al ioQ«n !>oun= 

ff*? « al.. 1995). 30-32 kDa (Ag 8o A-B-C) (|SS£J?£ ttSSJKS' S) 3 2 kSa 

Ag 84) (Hermans et al 1995), 38 kDa (Singh et al., 1992). ought to be cited. Most of these prote ns'cWea* S£ 
me corresponding prote.ns of mycobacteria other than-M. tuberculosis. None of them has been used so SSTSo^ 
S^^JT^T^ ^ ° r ^ te ' ar9e SCale ^emW^^^gSlS^VaSSa 

sl'^S , 9 K 6 " S "l" ° e ' ndis P ensab| a- Specificity of the assay has to be increased by selecting pepWes Xh 
P ! ^"V S6ra ,r ° m M - ^rculosis (ML bovis) infected subjects and not with sera from heatSJsSteS or 

subjects infected by mycobacteria other than M. tuberculosis. There remains a JSShq ne V^Sm 

S^eTrosT^ ^ M - tUberCU ' OSlS epitopel usllrnZim^Sg^ 

AIMS 

2£r^£^* 1,16 CUrr6nt inV6nti ° n l ° Pr0Vide n6W U an «9^ useful in the immunodiagnosis of 

[0007] More specifically, it is an aim of the cunent invention to provide new M. tuberculosis polyoeotides containina 
species-specific epitopes, useful in the serodiagnosis of tuberculosis disease poiypeotdes containing 

55 EUL" I 5 m ° r6 T 30 m J° 01 the CU ' Tent invention 10 P^da the nucleic acids encoding said polypeptides The 

£37 H £ riTm ^ tranS ' ati0n Pr0dUCtS iS alS ° ^ ded ^ ^ current invention^ 

S IL o*.the current .nvention to provide methods for producing said M. tuberculosis polypeptides 

[001 0] It is moreover an aim of the current invention to provide methods and kits for immunodiagnosis SSSJSb 
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disease in mammals. 



rami 7 1 Finally, it is an aim of the current invention to provide for a vaccine compositionj f9*^ 
K Sieved that all the above-mentioned aims have been met by the different embod.ments of the current 



[0012] 

invention. 

DEFINITIONS 
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[0013] The term "nucleic acid" or "polynudeic acid- refers to a single stranded or double ^^ednucle-c aad^ole 
cute which may contain from 10 nucleotides to the complete nucleotide sequence (such as for instance 20, 30, 40 ,50, 
70 80 Tmwe nucleotides). A nucleic acid which is smaller than about 100 nucleotides in lengthy often also 
Sened to as %S£mSt2uL A nucleic acid may consist of deoxyribonuc.eotides (DNA) or MjMM RNA). 
nSJe a^ues or modified nucleotides, or may have been adapted for therapeutic g» 
cuSa^Sresented throughout the current invention by single stranded DNA-sequences. ext endmg from the 5 end 
to *e 5 eriftThouW be understood however that the nucleic acids of the currer* invention also 
demiaV^nd of the illustrated molecule, as well as the double stranded rr^^les^ the cor reding RNA 
SSZte wherain the deoxyribonucleotides are replaced by ribonucleotides, and wherein T is repteced byU 
SS? ^e eJTesSon^ Isolated- means thatthe respective component (nucleic acid, protein) -<^-£22TS 
orSeraWySTmost preferably 98% pure as measured by its weight versus the weight of possible .contemmant^ For 
CTSS^o^W of the invention, the purity is high enough in order to ensure a specrta reacfvrty of the 

for amino acids, without the occurrence of a termination codon. The sequence of an ORF is (potentially) translatable 
SKITS! term TDolypeptide" designates a linear series of amino adds connected one to the omer by p^ 
Sin^atonlSnoand carboxy groups of adjacent amino acids. In the context of the present invention, the term 
S^e-^rused^Seab.^ the term "protein". Polypeptides can be in a variety of ^^shorte 
$fiS& oflo aminoacidsinlengthorshorter). ^"^^^^ 2^^^*^ 
occurlther in their natural (uncharged) forms or in forms which are^satts^ e^r freeof 

cosvlation side chain oxidation, or phosphorylation or containing these modifications. It is well u ^ er ^oo J" ™ * ■ 
SSSS sSuences cor^inaS and basic groups, and that the particular _ionirat.on ^^f^ 
LdvSde is dependent on the pH of the surrounding medium when the protein is in solution or thatofthe medium 
it v2 » dbSned if the protein is in solid form. Also induded in the definition are proteins modrfied by addi- 
SilSl^^cS to the amino acids side chains, such as glycosyl units, lipids, or ir^mo ^chas 
SrahrtTae well as modifications relating to chemical conversions of the chains, such as ox.dat.on of sulttydryl 
35 £p?Thts the polv^epMes of the invention also include the appropriate amino add sequence referenced subject to 
mS of ^*rforidn?modrf ications which do not destroy its functionality (i.e. immunoreadivrty^ 
Tm 7\ r ^e^olypeptides of the invention, arri particularly me shorter peptides, can be prepared by classical chemical 
synthesis. The synthesis can be carried out in homogeneous solution or on solid phas* _ described by 

r00181 For instance, the synthesis technique in homogeneous solution which can be used is the one descrtoed ey 
4 o Knwe^iSook entitled "Methode der organischen chemie- (Method of organic chemistry) edrted by E. Wunsh. 

Eonaid Sh^fd in their book entitled "Sold phase peptide syntheste" (IRL Press Q-JJ1«* ^ 
ZT ^ Doivoeotides according to this invention can also be prepared by means of recombinant DNA techniques 
« KLrSKSS^ (Macular Coning: A laboratory Manual. New York, Cold Spnng Harbor laboratory. 

that the polypeptide contains antigenic regions which induce a 
Era. 0?fo!nSon of 'antibc^r a cellular immune response in the host organism which has been infected by 

SSL 3 thTh^roSnSm 9 The immune response may be either of the humoral type (raise ,n antibody titer) or of 
the cellular tvoe (specific lymphoproliferation. secretion of certain lymphokines...). 
55 SoS] ^erm^erodiaXis" refers to the diagnosis of a disease by the detect.cn ,n the serum of a raised titer of 

J^et^ 

ratio of "true positives- over the sum of "true positives" and "false negatives". 
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=™ 

Esp?^ 

in the aS They n^^X^S^SS!% fSff m ^ I be . determin « 1 >»W of the techniques known 
tains usuai.y /stretch of aX*T^ 

24. 25, 30. 50 contiguous amino acids ' ' • " 3> 14, 15, 16 ' 17> 18> 19 ' 20 - 21 - »■ 28. 

^^g^rS^^^SS 5 ^ to , an t t0P6 "° ■* be —Q*- by anybodies 
nized * antibodies^ "gSST^S" 8 M h "S"** ^ ^ W " «* be>ec °9- 
mycobacteria. Species speclcTy nSb!^L uX^L* n ? t ^. anUbo *» rais «> gainst other types of 
technique used in the e*a%2, seSon EXSS?" ""T* ? Mff0 lab ° ratory animals - such as «» 

"J** ^ ^^S^£2SeVecr 3150 be ^ Uat6d artib0dieS ^ 

man and bovine. It shou^be deaTtrit wherSeSrm 5 ^T^*™' 0 ? ^ m0re in 
beiongingtothe M to^JL aXSd^ 

aMcanum. W™. ana mus includes at least the following species: M. tuberculosis, M. bovis, M. 

which .OWMons «, genoa,, coSS ISSTcSSSP te to "*»•*• *»«■*> * 





TABLE 1 




Overview of 
stitutions. 


conservative amino acid sub- 




Amino acid 


Conservative substitution 
by 




Ser (S) 


Thr, Gly, Asn 


35 


Arg (R) 


His, Lys, Glu, Gin 




Leu (L) 


He, Met Phe, Val, Tyr 




Pro (P) 


Ala, Thr, Gly 




Thr(T) 


Pro, Ser, Ala, Gly, His, Gin 


Ala (A) 


Pro, Gly, Thr 




Val(V) 


Met. lie, Tyr, Phe, Leu 




Gly(G) 


Ala, Thr, Pro, Ser 


45 


lie (I) 


Met Leu, Phe, Val, Tyr 




Phe (F) 


Met, Tyr, He, Leu, Trp, Val 




Tyr(Y) 


Phe, Trp, Met, He, Val, Leu 


50 


Cys(C) 


Ser, Thr, Met 


His (H) 


Gin, Arg. Lys, Glu. Thr 




Gin (Q) 


Glu, His, Lys, Asn, Thr, Arg 




Asn (N) 


Asp, Ser, Gin 


55 


Lys(K) 


Arg, Glu, Gin, His . 




Asp (D) 


Asn, Glu, Gin 




Glu (E) 


Gin, Asp, Lys, Asn, His, Arg 
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TABLE 1 (continued) 



Overview of conservative amino acia suu- 

etHirtlonS. 


Amino acid 


Conservative substitution 
by 


Met(M) 


4le, Leu, Phe, Vai 



„ DETAILED DESCRIPTION OF THE INVENTION 
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fcU Utounir i iw" ~- 

1S from tuberculous patients: seauence 

R2 and/or B3 protein, or its homologues. 

^nESCSEQIDt^issr^^^ 
10033] The amino acid sequence of prote.n B2 (SEQ ic no j ^ n 1 626) „ 

SSuctofORFI (an^noackJ sequence sho^be^ 

at l«ast 45 cortfeuow nudaoliMs o( the swtKmte „. ftem oaaMc tmetontoleeulat antt- 
resoonsable for ite immunodom.nant character (Cocrtoet^, and ^eous testng (Benort « 

owrng to the extensive cros* antc ,vf the A60 complex which contain 

DNA segment including the two genes arid their regulatory s ^ hesis d ^ proteins B2 andB3_The 
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(iH) a nucleotide sequence whth ic w (K) pr0te,n ^ te 

M . *.g™» of My of ^^Sd^SS^J 8 '*"*«■ - £5 a ^Protfn. ora 

- from position (71) to position (1951) 

- from position (398) to position ,(1951) 

- imposition (1063) to position (2604) 

- imposjUon^toposftio n(i 438). 

- fromposrtion (1439) to position (1951) 
" fromposrtion (1438) to position (2604) 

- the nucleic acid fragment r398W.«« 'estartmg from the fnt presum- 
sumabie start codon " (398W1951 > en «** B2 protein in its shorter form startino from ,h 

■ the nucleic acid fragment f 10631 ft*™ 9 6 S6COnd pre " 

• the nucleic acid taSSff the » protein, 

plessection. 9 " <( 272 H1«8) encodes the a390 fragment of the B2 protein describe.* 

- the nucleic acid fragment (1439W 195 „ e ^ " 6Xam ' 
examples section. " (1439) (1951) encodes *• al7l fragmem of the B2 protein describe * rt „ • 
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well known in the art of recomoina ^onroded by any of the nucleic 



[0050] .-- - 

20 lowing amino acid sequences: 
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Ufcerculouspatients. ^.pM*^*^^*** 



[0051] — 

ing amino acid sequences 
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OQPPSRBRQDRHQSTRR (G-17-R peptide, in the 82 (r«p B3) protein. The positions 

ofbothpeptKJesare.nd.catedonthe eq ^ for a ^on protein comprising an amino acri 

sequence of a polypepude as ^^ n ^ lypeptid es of the invention as fus.on P^n^ dpurification due to 

100541 The P^tn^S e^cXK £e polypeptides of ££«JJJ2 ^asnes P 

lages. such as e.g. enhanced ^ res of the polypeptide •°«£»"" u ft remain attached to 

steps are required: v in which a nucleotide sequence 
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10056] The techniques for carrvinn m •+ *, 

of the hybridoma to produce?? f 01 8 myeloma «« <™ on ttJo^L^T"^ a9ainst the Polypeptides 
10060] The monoclonal an«bodies acc0 rH- ♦ ^ /Mepraeswh,cnhav e been .nrtially used for 

[0061] . Also fragments d^enVedfrom 9 for H ^ L chains or from cDNA^m departing from the mouse 

fragment",, providing they S ° nMtonal a "*»dies such ^2^7^ ^ ? H Snd L chai " s - 

™ rt ^e commonly generSSb^^ 
teases. It is well known to thepeSn^ 

for vanous uses. P ■ ,,ed ,n *• that monoclonal antibodieTnr^, Wain ' or <*w pro- 

f0062] The antibodies of the curr^- am,bod,es -or fragments thereof, canbemodSS 

^•**»Sr,Sr — - ~ « .. ant) I \ 

animal. 

od.esorantgens.nasample.issubiecttoagreatdea. 
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~ .« *ho ort Mnot assavs involve the use of labeled antibody and/or polypeptide, 
of variation, and many formats are known n the art. phosphatase), fluorescent, chemolu- 

(cortwM). me aijnal |WW t» «• "* „ e imer ^ us* mm m as a apu* asant 

TO073] In an antibody sandwich EUSA, the arrtbod.es ^ rtib ^ reacting ^ the bound 

SSX » a preW a^odimam. the P**daa to'. >m»o***« a*«, » _* as 

described above, wherein said method or assay is a " ^-'^ immunodiagnostic kit for the in vitro detection 

(0075] Ac^rdingtoanotherembo^ or & as 

a, of tuberculosis disease in mammals, said kit compnsing a poiypeproe as « 

the invention has been readily immobilized. im „,ntinn are coated may be chosen from the supports 

[0078] The solid support onto which the polypepbdes of ^^Xia^VS support may be a polystyrene 
Commonly used in the art of immunodiagnosis °^SSSSSl a3 men^ rane (LIA (Line Immune Assay)). 

antibodies in a biological sample liable to contain them, comprising:. 

, ^ rtoHnad above Deferentially in combination with other polypeptides or peptides from M. 

; =n« 

S^SSS an automated scanning and interpretation device for inferring the presence of M. 
tuberculosis antibodies in the sample from the detected signal. 

m [0080, According to another embodiment, the 1^ " ^ " 

ELos/s antigens in a sample, said kit comprising an ^method using at least one of the 

0081] According to yet another embodiment the ^^^^^^^^^m * cellular 
polypeptides of the invention, possibly in combination ^^^^^S^hogen. said cellular immune 
Tmrnune response in an indMdual which has been in nwrta^ ™£ u £ n subcutaneous injection 

UettMeatasth«t»aaub]ecthasbe^ 
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described above, or a nucleic Zi S£SS2^^S2* " "-J"" *»«■ or a **" protein as 
general. 09 ""Pone* or any other bacter.al or other immunogenic component in 



FIGURE LEGEND 
[0087] 



and transferred to nftroceOutow ^emb^Tic?^^^^ f electro P hores « (12% SDS-PAGE), 
wHhapoolof tubercu.ouspa.entsse^eC)^ 0- B). or incubatS 

(ORF1) and 1539 (ORF2) nucleotides TtESKSSE. J!J ^ ^ ^ readin 9 fra ™* of 1878 

bold characters. Promoter-lite ma^Z « ribosomal «"*0 are indicated in 
aa respectively, whose deduced amino add seaSn™^ 9eneS enCOde proteins °» 626 «nd.S13 

the protein B2 (SEQ ID NO 2) The ur^f,' - ^ displayed. The lower amino acid backbone reoresents 
cates the fin* amino add K^ffJ?^ P^ein B3 (SEQ ?S 7?S! 
ECOR1 and Sa.1 restriction S^K^ TareS STa T 1,16 2AP ' Tb22 ^ ^ 
wh.ch have been expressed in £ co// are indicated witn ° J?"?. fra9mente a39 °. a 171 and a389 

™P™enu^^^ 

J« 3. Schema*; diagram of subclones used for expression anchor sequencing of portions of the b2 and b3 

^^^^^f!!Si^^^ ~ of pmTNFIIXH, yielding the 
The insert of the plasmid is shown in shaded iL^ ' * *" ^ 01 the pl lanibda P™""*- 

(e) pojjon of the Tb3 Cone (Pstl-SaH) carrying the upstream region and presumptive starting codons o, the 

(f) position of the region of which the nucleic acid sequence is iHustrated in figure 2. 

celWose membranes, and reacted with wfoSSJ^^S? * 12% J DS - pAGE . ^"sferred to nitro- 
wrth a M.avium lysate. Lane A' : lysate 0 f IS dh£2£?„ l'*' preabsorbed B') or not (lane B) 

: molecular mass standards ^ ""l"" 8 00 abacterial insert (negative control). Une A 
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Rour« 5. Western Mot analysis ollh* -"^^^^^1^.^^ 
ir |mapm™F.XH.a3e9e^^<aheat.^^ 

molecular mass standards, 

RgU re 6. E.ecrophore.ic analysis o, the proteins ^pres^by recount ^^"^^te 
pmTNF.XH.a1 71 . The indusion bodie* fratfon JIESm-S* ^^mAOm^p^ 

(B) inclusion bodies. 

, . . hoaHhu fBCG vaccinated and non-vaccinated) and tuberculosis 
Rgure 7. Comparative .erologica. "f^^^SEE values (A^nm) are reported Ion jh.. 
patients sera by the A60 and the a171 Diffefent o, tuberculous (tbc) patents sera 

ordonate axis. The horizontal lines represent tne ^alues ™^^ rfecHon)( toc miliary (=pa tients with 
were considered", tbc post-primary (= patent s ^^«%£^J^ from which tuberculous bacilli 
disseminated form of the dis ^^^SS^M ES*nE» tuberculous infection of different 
have been isolated from the cerebrospinal fluid), tbc sera mixea v 
organs, besides a pulmonary tuberculosis). 

SaSesS ^^—T* 1 ^^ 
ImiriSStS 455 in ine B2 protein sequence as shown in figure 2. 

RS ure 9 :Hydrophi^^ 

combined methods of Parker et al. 1986 Jf^^^Sed in blade Position 1 in this figure corre- 
Rgur e10: M apo^^ 

tiion protein with the first 25 amino acids ofmTNEThe ^^^^tNfTdCH) or two (pmTNF.l.XH) 
pmTNFIXH except for the sequence of the polylmker a ( , „ of ap inser1e d nudeic ac.d 

additional nudeotides. allowing the trar^aton^*e^er^r^g^rame^ k ^ fe gjven below: 
molecule. A detail of the nudeotide sequence of the polylmker srte tor 
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polylinker site of pmTNFIXH: 

COG CAG ACA CCA TCA CCA CCA TCA CTA ATA GTJ^M S^Ls a* 

W^t^tVypBti^AgrZ taa ... 

(SEQ ID NO 7) 



polylinker site of pmTNFIIXH 

•^GGG.S 

Stul " ; ~p mi j 



AGG GCA GAC ACC ATC ACC ACC ATC ACT AAT AGIJMJJA^cT^SS 

Pad *™ 

0*^^ CG a£^pc5AG*AAG TAAGTAA... 
(SEQ ID NO 8) 



^P"' StU . Pmll 



polylinker site of pmTNFIIIXH: 

Stul "Pmll 

ATC GAG GGC AGA CAC CAT CAC CAC CAT CAC TAA TAG TTA ATT^ftk 

Pad 

S5a! MSP f ^^£C GS! ^A»AA GTA AGT AA ... 
(SEQ ID NO 9) 
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EXAMPLES 

i NtejgrialS and methods 



pM Boo^r-no-T^ng^o^ 

SSTuJ* cm and SG4044 (■"go"** "?J£3£Z^£S* P»««*W ^»"™* 
10091] Polyclonal raU* .ntbodlo. agonal ^^^Z^S^aJ^tm KU4 canto (*»« (Noosyo- 

oTScal mycobacteria. Sera diluted (1/50) in TBS J™' ' J£J ^ incubated overnight at 4*C with 

Kesel et at 1998). Purification of phage ^^SSSSSSm flSj. The boiling minifysate method (Holmes 

Screening ^^^^^SttS^S NO 6)) and riylon membranes (Hybond- 
biotinylatedDNAprobeTbbiot1(5-bGCG(3eou.iM^ 

N+ Amersham, Buckinghamshire, UK). « r,M a ^nances aDoropriate restriction fragments of donea 

obtained with the M13 forward and re/erse pnme^ . introduced in E.coli SQ 4044 by transformation 

[0096] Expression of ORFs by ^^^^^ottScydine to an Asoo of 0.2 and then heat 
Single transformed colonies were grown * 28_ C . n me PJJ> and analyzed by western blot, 

shocked (4h. 42°C). Cell .ysates were mned ^DSgel toad .ng ^ pmTNF . a171 pl asmid. were concen- 
.00971 Purification of a171 polypeptide. E.coh se w« * inclusion bodies were pelleted by 

Ed * centrifugation and lysed by the W^^SQ ureal 8M et a.. 1989). 
centrifugation, washed with Triton X-100 and EDTA, and so^brt z ^ w ™ « ^ ^ w cell lysa tes. or a171 
[^98] SDS-PAGE and Western ™ a f;^ P^einfrac- 
Polypeptide) in gel loading buffer, were bo.led * MOO 5 ™ ™ , electropnoreS is unit (gel 1 6 by 1 8 cm. Hosier 8a- 
Sonation was carried out at a ^^ e %^^^^^^ P tan8b|olted (LKB 217 Mulfphor 2 
entific Instruments. San Francisco. CaL). ^SSS^iSSS. 0.039 M glycine and 0.048 M Tris basepH6 8) 
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42-C when lysis plaques were just seen, milerT^e^ ^JT^^ directons - ^on at 
ogalactopyranoside (IPTG), and incubated I overnta^^c f?1 U '°! e " ,ters S3,Ured isopropyi-p-D-thi- 

patients sera diluted (1/20) in TBST buffer msh ZT ^ 9 ^ at,n - and then for 12 h *Hh a pool of 5 tuberculosis 
were revealed with an ^SJ^^^ZSZ^t^ """"^ Md 1 

Phages were plated in soft aga? iSJl^ SVSf T Pl ^e-pur,ied recombmnt 

one n,ght at 37°C. Approximative* 10 plaquSrticS TZ^S^^ inCUbaled for 4h at and then 

. ssss^ (0 - 125 M Jri *™>» ™^ 

sfrongly reacts clones were selected out of soTe 5u0 000 n^- ^ *, ^ ° f tubercul ^s patients sera: 21 
clonal serum: some 16 positive clones were seized andZv^ * ^ W6re retested ant-A60 poly- 
P°°' °/ ( *"be rcu losis patients sera preabso^ fcS^^TTT Pr0dUCts were with the s?me 

and Weprae strains). Onephage (ZAP-Tb^eS ^rcSuct^K" 8 T* 3 * 6 * 3 •*« M. avium 

Mtateiwtosa. with respect to the other mycob^eSSpJ^f^ 5 ? y reaCtn9 patients ' sera and specific tor 
of the recombinant phage showed two bKSSS m TkS S2T "2 ^ * pr ° ,6ins from *• PlaquS 
[0102] Analysis of the Tb22 region of rewmWn^nl^K "9 wrtn ^^'osis patients sera (Rg 1) 

* ! Sff - * 1- DNA Se9 ™" « ^™S^J£^£™™* R ^nant phage ZAM^a I carrying 
a DMA insert of about 3,700 bp (Tb22) as det«min2^ P T and l * ta * M(l E «>R1 endonuclease vieldinn 
was determined (see figures 2 and3) ^ * electro P"o^is. The DNA sequence of ST5a 
[0103] The expression of Tb22 was analysed bv s..hrionin„ », 

an expression vector family (pmTNFXH J SlS^! r ^5 ,ctl0n ^Sments of the insert in the members of 

• '"figure 10. These construct allow 7»iJ^j£S^£P JF*"* - *~ vectofs 
proteins wrththefirst 25 amino acids of nwuseSrnec^S ^ *! 3 readin 9 ^mes. as N-terminal fusion 
also Gilot et al. 1993). Three of the analwJd ™ J"" 6rthe contr ° l *«»P« lambda promoter (see 
DNA fragment, subcloned in vector SriS ST *° bS interesting. Thus, a 1 1 tSE&S? 
the initialpart of a gene comprised S^'cS? Pernio". (pmTNF.XH.a390) which 
sunned in the same vector (piasmid pmlVlRXH aT 7 iTS^SST* ™ 6 ad , jacent 2411 ^EcoRlTegment 
ever, this same DNA fragment, upon insertion in vector iJKK? , 1 urtne / I port,on bp) of the same gene How- 
sion of a different reading frame on the ^ mS^^SZT* P"™F.XH.a389). which allows expres- 
^JJPJsed m the Tb22 clone (gene b3). expressed a 1 167 bp product, forming part of a secondgene 

coT|^ 

ton fragments in the recombinant strains. TheWa,^^^^ 

?-*• ***** ° Ver,appin9 readi "9 ^es ORF^and cSKiZSJ T*""' differant 
examples will demonstrate that fragments of both proteins nST-JS ? * f ' 9Ur6S 2 and 3 " ^ subsequent 
and contam species specific epitopes useful inTeSagK * * *~ °' ^ooTsera. 

III. Charactf n r atlon m<\ prnressmn ^ tti r M ^ fnnri) 
[0105] Base composition and expression of ttieB5n OM * _ 
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^ 990 Da in 

absorption proved the B2 protein to nav pre dicted arrfi- 
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recombinant celts, submmea xo ™ QNA seque nce. These data indicate maw epitopes was 

genie peptide Q-17-R was syrrthesised. AntiDoo.e 
described in example 1. 

_ fi __ !fiE! nenL^^ 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME; Innogenetics N.V 

(E) COUNTRY: Belgium 

<F) POSTAt CODE (ZIP) : B-9052 

disease. 9 18 ancl Prevention of tuberculosis 

(iii) NUMBER OF SEQUENCES: 9 

Uv) COMPUTER READABLE FORM* 

(A) MEDIUM TYPE: Floppy disk 
(BJ COMPUTER: IBM PC^Lpatible 
n ?^2^ ING SYSTEM - PC-DOS/MS-DOS 

SOFTTORE: Pat ** tIn Release »???, Version #1 .30 (EPO, 
(V) CURRENT APPLICATION DATA' 

APPLICATION NUMBER: EP* 98870127 . 2 
(2) INFORMATION FOR SEQ. ID NO: 1: 

(X) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 2708 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinole 

(D) topology: linear 

(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 71.. 1951 

(ix) FEATURE: 

(A) NAME/KEY: matpeptide 

(B) LOCATION: 71. .1948 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1063. .2599 
<ix) FEATURE: 

(A) NAME/KEY: mat peptide 
<B) LOCATION: 1063. .2596 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GACTGCAAAC CTGGGTTGTC CCGCCGACAC CGCATTCGCG CGCGTCGCTG GCCCGA TAC G 60 
AGATAGCGGA ATG ACC ACC CCT TCC CAG CTG TCC GAC CGC TAC GAA CTT 
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50 



Met Thr Thr Pro Ser His Leu Ser Asp Arg Tyr Glu Leu 

GGC GAA ATC CTT GGA TTT GGG GGC ATG TCC GAG GTC CAC CTG GCC CGC 
Sy S£'K Gly Phe Gly Gly Met Ser Glu Val His Leu Ala Arg 

20 ^ 5 



GAC CTC CGG TTS CAC CGC GAC SIT GCG 5TC Mfi GTG CTG CGC GOT GAT 
Arg Leu His Arg Asp Val Ala Val Lys Val Leu Arg Ala Asp 
30 35 
10 CT - sec csc sat CCC AGT TTT TAG CTT CGC TTC CGG CGT GAG GCG CAA 

£a Arg Asp Pro Ser Phe Tyr Leu Arg Phe Arg Arg Glu Ala Gin 

„_ „_ -_ G ttg AAC CAC CCT GCA ATC GTC GCG GTC TAG GAC ACC 

E£ S Sa aS "u £n Hxs Pro Ala He Val Ala Val Tyr Asp Thr 
15 65 ^ 

S S K Si S S SSSKS S S S S 

80 85 

S S S S'S S = 5 S.S S E S S SJ S 

95 100 105 

ATG ACG CCC AAA CGC GCC ATC GAG GTC ATC GCC GAC GCC TGC CAA GCG 
Met Thr Pro Lys Arg Ala lie Glu Val lie Aia as P ~x y ^ 
110 US 

S S£ S 52 ='S S'S SSSSSSJS 
5SJSSS2SSSS5SSSSS 

160 165 

SSS2SSS25SSS2225 

175 180 185 

gssass S-5 = s i = = = - S 



210 



GTC TCG GTT GCC TAC CAA CAT GTG CGC GAA GAC CCG ATC CCA CCT TCG 
Val Ser Val Ala Tyr Gin His Val Arg Glu Asp Pro lie Pro Pro ser 



225 



157 



15 

GAT 205 



SSSSSS £ S SS S S S5 S 

240 245 

S S 3i.S 5 E SS S IS S S S S ■ S S 



253 



301 



349 



397 



445 



493 



541 



589 



637 



685 



733 



781 



829 



877 



255 260 



55 
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Arg Ala Asp Leu Val Arg Val H± C n*^ GST GAG CCG CCC GAG GCG CCC 
270 P ^1 Val *" Asn Gay Glu Pro Pro Glu Ala Pro 

280 285 

s s = s 5 s a s b = = 2; a is s K - 

295 300 



310 315 
325 330 

S SS E S 2 22- « a ST S 32 S £ 



345 



ATC AAC ACG TTC GGC GGC ATC ficc rmn ^ 

lie Asn Thr Phe Gly Gly lie Thr ^ 77 ^ GTT CCC ^ GTT 

350 Y 116 Thr Val Gin Val Pro Asp Val 



20 ,30;5 360 



365 



425 

455 460 



475 

BSS52SS222-5---- 

45 485 490 

S5522SSS52SS5SE2 



505 



515 520 525 



1021 



1069 



1117 



1165 



1213 



552S5S5S2SS2SJS-- 

375 380 
330 335 

405 410 



1309 



1357 



1405 



1453 



1549 



1597 



1693 
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530 



535 



540 



OOC ACC ACA GTT CCG GT C GAT « « « GAA CTA CAG «• «P 
Gly Thr Thr Val Pro Yal Asp Ser Val lie e±u ^ 
545 . 550 

„ m nrr r n C CTA TC C GGC ATG TTC TGG GTC GAC 

« AAC CAA TTC GTC ATG CCC GAC CTA TC^ ^ ^ ^ ^ ^ ^ 

Gly Asn Gin Phe val Met fro **= * 
560 565 

575 580 

S S S « 5 S If, Sj K Si S = = - 3 £ 

590 595 

S SS55SSS 

610 615> 
CTG AGG TTC GGC CAG TAG CTTCGCCCGC GCTAGCGGTT GAAGTTGGGA 
Leu Arg Phe Gly Gin * 
625 

AGTCCGGTGA GATGGCGGTG AGCACTTCGT TTTCCAGCCG ACGTACCRGG GTGTCGTCTT 
GCGTCCATCC GCAGCAGGTG AGCCAGTTGG CCAGTATGCG GTGCCCGCCC TCGGTGAGAA 
TCGACTCCGG ATGGAACTGG ACACCGTGGA TCGGCAGCCC GGTGTGCTGC ACGGCCKTGA 
TCACACCGCT GCTAGTGCGG GCCGTGACCC TCAGCACCGC TGGCAGCGAC TTAGGCAGAA 
TTTTCAACGA ATGGTATCGA GTGGCCGTGA AGGGfiTCCGG. BAGCCCTTGT AGCACACCGA 
CATTGGTGTG GAATACGCTG CTGGTCTTGC CGTGCAATAG CTCGGGCGCA CGGTCCACGG 
TGGCGCCGAA CGCAACGCCG ATGGCTTGGT GCCCAAGGCA GACCCCCAGC AAAGGGGTGT 
GTGCTGCCGC ACACGCGTGC ACGATACTCA CCGACGCGCC CGCGCGCTCC GGGGTAC CC G 
GACCGGGACT GAGCAGGACA CCGTCGAATT GGCCGGCGAC GGCGGCCTCR TCGGATAACC 
^WTCGTC GTTGCGCCAC ACCTCGGCCT CGATGCCGAG CTGGCCGAGG TACTGCACCA 
GGTTGARCAC GAAGCTGTCG TAGTTGTCAA CGACCAGGAT CCGCATCGTG ACAGGCTACC 
GCCTGACCCT GCCGGCAGTC GTTCAGTAGT CCAAGGGGCC TGCAGGCTGC GCGAAGTGCA 
TCCGGAC 



1741 

1789 

1837 

1885 

1933 

1981 

2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2708 



(2) INFORMATION FOR SBQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 627 amino acids 

(B) TYPE: amino acid 
(O) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION! "sEQ ID NO: 2: 
„et Thr Thr Pro Ser His Leu ser Asp Arg Tyr Glu Leu Gly GIu He 

1 5 
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Leu Gly Phe Gly Gly Met Ser Glu Val Hi, Leu Ala Arg Asp Leu Arg 
Lou His Arg Asp Val Ala Va! Ly, Val Leu Arg Ala Asp Leu Ala Arg 
Asp Pro s« Phe Tyr Leu *g Phe ^ ^ slu ^ Gln ^ ^ Ma 
Ala Leu Asn Hi S Pro Ala He Val Ala Val Tyr Asp Thr Gly Glu Wa 



75 



80 



Glu Thr Pro Ala Gly Pro Leu Pro Tyr lie Val Mat Glu Tyr Val Asp 

Gly val Thr Leu Arg Asp He Val His Thr Glu Gly Pro Met Thr Pro 

105 110 

Lys Arg Ala lie Glu Val He Ala Asp Ala Cys Gin Ala Leu Asn Phe 

120 125 

ser His Gin Asn Gly iXe lie His Arg Asp Val Lys Pro Ala Asn He 

i33 140 
»et Xle ser Ala Thr Asn Ala Val Lys val Met Asp P he Gly Ile Ala 

155 160 
Arg Ala lie Ala As^ Ser Gly Asn Ser Val Thr Gin Thr Ala Ala Val 

170 175 
Xle «ly Thr Ala Gin Tyr Leu Ser Pro Glu Gin Ala Arg Gly Asp Ser 

185 190 
val Asp Ala Arg ser Asp Val Tyr Ser Leu Giy C ys Val Leu Tyr Gi u 

200 205 

val Leu Thr Gly Glu Pro Pro Phe Thr Gly Asp ser Pro Val ser Val 

-215 • 220 



Ala Tyr Gin His Val Arg Glu Asp Pro Ile Pro Pro Ser Ala Arg 



His 
240 



Glu Gly Leu Ser Ala Asp Leu Asp Ala Val Val Leu Lys Ala Leu Ala 

25 " 255 
Lys A»n Pro Glu Asn Arg Tyr Gin Thr Ala Ala Glu Met Arg Ala Asp 

265 270 
Leu val Arg val Hi, Asn Gly Glu Pro Pro Glu Ala Pro Lys Val Leu 

280 285 
Thr Asp Ala Glu Arg Thr Ser Leu Leu Ser Ser Ala Ala Gly Asn Leu 

295 300 
Ser Gly Pro Arg Thr Asp Pro Leu Pro Arg Gin Asp Leu Asp Asp Thr 



315 



320 



Asp Arg Asp Arg Ser He Gly Ser Val Gly Arg Trp Val Ala Val Val 

Ala val Leu Ala Val Leu Thr Val Val Val Thr xle Ala Xle Asn Thr 

345 350 
>he Gly Gly xle Thr Arg Asp Val Gin Val Pro Asp Val Arg Gly G Xn 

365 
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»i. ai a He Ala The Leu Gin Asn Arg Gly Phe Lys He 

Ser Ser Ala Asp Ala lie Ma - 38Q 

370 373 
^ Thr Leu Gin Lys Pro Asp Ser Thr He Fro Pro Asp His Val lie 
385 3 ^ 

el y Thr Asp Pro Ala Ala Asn Thr Ser Val Ser Ala Gly Asp Glu He 



405 



Thr val Asn val Ser Thr Gly Pro Glu Gin Arg Glu He P~ Asp Val 



420 



-.x. m , 1a r1n Ala Val Lys Lys Leu Thr Ala Ala Gly 
Ser Thr Leu Thr Tyr Ala Glu Ala vai x.ys y ^ 

435 440 
Ph e Gly Ar* Phe Lys Gin Ala ^ ser Pro ser Thr Pro Glu Leu val 



450 



G ly Lys val He Gly Thr Asn Pro Pro Ala Asn Gin Thr Ser Ala He 



465 



Thr Asn val val lie He He Val Gly Ser Gly Pro Ala Thr Lys 



485 



XXe Pro Asp Val Ala Gly Gin Thr £X Asp Val Aia Gin Lys Asn Leu 



500 



M „ val Tyr Gly Phe Thr Lys Phe ser Gin Ala Ser Val Asp Ser Pro 

515 520 
^ Pro Ala Gly Glu val Thr Gly Thr Asn Pro Pro Ala Gly Thr Thr 

530 535 
V ,X Pro'val Asp Ser val He Glu Leu Gin Val Ser Lys Gly Asn Gin 
545 550 

Phe Val Met Pro Asp Leu Ser Gly He* P^e Trp Val Asp Ala Glu Pro 

Leu Arg Ala Leu Gly Trp Thr Gly Met Leu Asp Lys Gly Ala Asp 
580 58 ? 

val Asp Ala Gly Gly Ser Gin His Asn Arg val val Tyr Gin Asn Pro 
595 600 



Pre Ala Gly Thr Gly Val Asn Arg Asp Gly He He Thr Leu Arg Phe 
610 615 



Gly Gin * 
625 

(2) INFORMATION FOR SEQ ID NO: 3; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 513 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Leu Gly cys Gly Gly Arg Arg Ala Arg cys Ala Asp Arg Arg Gly 



1 



5 
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His Arg His Gin Hia Val Arg Arg His His Pro Arg Arg Ser Ser Se£ 

23 30 



Arg Arg Ser Gly ser He Leu Arg Arg Arg His Arg His Thr Ala Lys 

Pro Gly Leu Gin Asn Pro His Leu Ala G lu Ala Gly Asp Asa Pro 

03 60 
Thr Gly Pro Arg Tyr Arg His Arg Pro Gly Arg Gin His Val Gly Glu 

75 80 
Cys Arg Arg Arg Asp His Ser Gin Arg Val His Arg Thr Gly Ala Thr 

90 9S 
Arg Asn Thr Arg Arg Leu His Ala Asp n e Arg Arg Ser Gly Gin 



110 



Glu 



Thr Asp cys Arg Arg He Arg Pro Leu Gin Ala Ser Glu Phe Ala Val 

120 125 

His Pro Gly Thr Gly Gly Gin Gly His Arg Asp Gin Pro Ala Ser Gin 

135 140 
Pro Asp val Gly His His Gin Cys Gly His Hi, His Arg Trp Leu Trp 

155 160 
Ser Gly Asp Gin Arg His Ser Arg Cys Arg Gly Pro Asp Arg Arg Arg 

170 175 
Gly Ala Glu Glu Pro Gin Arg Leu Arg Leu His Gin He Gin ser Gly 

185 190 
Leu Gly Gly Gin Pro Pro Ser Arg Arg Arg Gly Asp Arg His Gin Ser 

200 205 
Thr Arg Arg His His Ser Ser Gly Arg Phe Ser His Arg Thr Thr Gly 

Val Gin Gly G ln Prp lie Arg His Ala Arg Pro He Arg His Val Leu 

235 240 
Sly Arg Arg Arg Thr Thr He Ala Arg Ala Gly Leu Asp Arg Asp Ala 

250 255 
Arg Gin Arg Gly Arg Arg Arg Arg Arg Trp Leu Pro Thr Gin Pro Gly 



270 



Arg Leu ser Lys Pro Ala Gly Gly Asp Arg Arg Gin Pro Gly Arg His 

285 

His His Ala Glu Val Arc Pro Val ai a e-.. « 

290 9 1 Ser Pro teu Ala Val Glu 

295 300 

val Gly Lys Ser Gly Glu Het Ala Val ser Thr Ser Phe Ser Ser Arg 

315 320 
Arg Thr Arg Val Ser ser Cys Val His Pro Gin Gin Val Ser Gin Leu 

335 

Ala ser Met Arg Cys Pro Pro ser Val Arg He Asp ser Gly Trp Asn 

345 350 
Trp Thr Pro Trp He Gly Ser Pro Val cys cys Thr Ala Met He Thr 

360 365 
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.„ M « v., „ - s - - « - a « s * r w " u 
„ «I ... ». - - - - « « « " l - " y s " a 
~ - c. - a » ~ - « a - rat - ~ » - 

405 

«,w v«i Ala Pro Asn Ala Thr 
Pro cys Asn ser ser Sly Ala Arg ser Thr V.l Ala ^ 

^ Br . rt e-r lvs Gly Val Cys Ala 
Pro Met Ala Trp Cys *ro Arg Gin Tnr Pro Ser I* ^ 
435 

„. ». «. ~. «. - a « - - "* s "* w 8 " oly 

450 

« „ « » •« - ~ - - ~ s - TO ~ ** ~ 

„ _ -r ser ^ Arg His Thr Ser Ala 
Ala Ala ser Ser Asp Asn Arg Trp Ser Ser Leu Arg 495 
485 

m r „ c Thr L eu Asn Thr Lys Leu 

ser Her Pro Ser Trp Pro Arg Tyr cys Thr Arg ^ 



Ser 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) . TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

, ^ a1a . G lv Glv Ser Gin His Asn Arg 
L ys Gly Ala Asp Val Asp Ala Gly Gly ^ 

1 5 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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<Xi) SEQUENCE DESCRIPTION: SEQ ID NO* 5 - 

«, cm Pro Pro fer ^ ^ ^ Giy p ^ ^ t ^ 

Arg 

(2) INFORMATION FOR SEC ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS- 

B JSf* 21 bas « pairs 
(B) TYPE: nucleic acid 

C) STRANDEDNESS: slnole 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL; NO 

(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 6: 
GCGGGCGTT TCGGCTTCAC C 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 base oairs 

(C) STRANDEDNESS: sinole 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(lv) ANTI-SENSE: NO 



21 



40 (Xi, SEQUENCE DESCRIPTION: SEQ ID NO- T 

CAGGGAATTC GAGCTCGGTA CCCGGGGAGG CCTGCACGTG ATCGAGGGCA GACACCATCA 
CCACCATCAC TAATAGTTAA TTAACTGCAG GGATCCTCTA GAGTCGACCG GC^ , 

45 TTAAGTAAGT AA «-«XGCAAGC 120 

(2) INFORMATION FOR SEQ I D NO: 8: 132 

(i) SEQUENCE CHARACTERISTICS - 

A LENGTH: 13 3 base pairs 
so B TYPE: nucleic acid 

n ^S NDEDNESS - 'ingle 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic, 



55 
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10 



15 



30 



(ill) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO. 

GCCTGCACGT GATCGAGGGC AGACACCATC 
r^OCCGAATT CGAGCT CGGT ACCCGGGGAG GCCTGCACfcrl ^ 
CAGCCGAATT g^ATCCTCT AGAGTCGACC GGCATGCAAG 

MCACCATCA CT AAT AGTT A ATTAACT GCA GGGATCCT 



60 
120 
133 

CTTAAGTAAG TAA 

(2 ) INFORMATION FOR SEQ ID NO: 9:. 

(i) SEQUENCE CHARACTERISTICS : 
U) (A) LENGTH: 134 base pairs 

(B) TYPE: nucleic aci °_ 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii > MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

25 (iv) aoti-sENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



60 
120 
134 

35 GCTTAAGTAA GTAA 



40 

Claims 



45 



50 



55 



of the B2 and/or B3 protein, or its homologues. 
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•n w ana (y) representing any of the following val- 

- from position (71) to position (1951) 

- from position (398) to position (1951) 

- from position (1063) to position (2604) 

- from position (272) to position (1438) 

- from position (1439) to position (1951) 
■ from position (1438) to position (2604) 

(HQ the amino acid 2S£ Mend^T ^ reSented by SEQ ,D NO 3 
M «** amino ac W ^ 5 !l3&*° n , (626) h SEQ ,D N ° * 

M *• amino acid sequence extending from SSSfS to^5" ^ h SEQ ,D N0 2 - 
N the am,no acid sequence extending taSSS!^* 1 (626) in SEQ 10 N ° 2. 
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. coring a turned cellar host accords to ciaim 5 u^er — s « the heteroiogous expres- 
sion of the polypeptide or the fusion protein which .s encoded 
. isolating the polypeptide or fusion protein from the culture broth. 

fctoercutosfe antibodies which may be present in said sample. 

ESSE S&Sl. — — - — * - - " 

antigens which may be present in said sample, 
claim 10. 

being combined with a pharmaceutically acceptable earner. 
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